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ELIASHBERG'S PROOF OF CERF'S THEOREM 



HANSJORG GEIGES AND KAI ZEHMISCH 



£™s ' Abstract. Following a line of reasoning suggested by Eliashberg, we prove 

^vj , Cerf 's theorem that any diffeomorphism of the 3-sphere extends over the 4-ball. 

To this end we develop a moduli-theoretic version of Eliashberg's filling-with- 

holomorphic-discs method. 



(N 

f— i ■ 1. Introduction 

w' ' The abelian group T n of orientation preserving diffcomorphisms of the (n — 1)- 

C"| . dimensional sphere S*" -1 modulo those that extend to a diffeomorphism of the 

n-ball D n plays an important role in differential topology, cf. [IB]. By the classical 
work of Kervaire-Milnor [T7] on homotopy spheres and Smale's [5^ solution of the 
higher-dimensional Poincare conjecture, T n can be identified with the set of oriented 
smooth structures on the topological n-sphere for n > 5. The correspondence is 
given by associating with [/] e r„ the smooth structure on S n obtained by using 
the diffeomorphism / of iS n_1 to glue two copies of D n along their boundary. 

o 



It is easy to see that Ti and I^ are trivial. The result 1^ = is due independently 
to Munkres [24] and Smale [28]. The argument of Munkres is quite elementary; 
C*~) ' using the Poincare-Bendixon theorem for foliations of the plane, Smale actually 

r — , proves the stronger statement that the 3-dimensional orthogonal group is a strong 

deformation retract of the diffeomorphism group of the 2-sphere. For n > 5, the 
groups r„ are amenable to computation by the results of Kervaire-Milnor, for 
instance T 5 = T 6 = 0, and Tj is the cyclic group of order 28. 

The statement T 4 = is known as Cerf's theorem [?]. One consequence of 
this result is that there are no exotic smooth structures on S 4 that can be ob- 
tained by gluing two 4-discs. Thanks to the obstruction theory of Munkres [25] for 
smoothings of combinatorial manifolds, Cerf's theorem also implies, for instance, 
that every combinatorial n-manifold admits a smoothing for n < 7, unique up to 
diffeomorphism for n < 6. 

In [B] Eliashberg proposed an ingenious proof of Cerf's theorem based on his 
classification of contact structures on S 3 and his method of filling with holomor- 
phic discs [5]. Eliashberg-Polterovich [7] gave another topological application of 
this method; they showed that the space of local Lagrangian 2-knots in R 4 is con- 
tracture. 

Unfortunately, the survey paper [5] does not include proofs of the essential com- 
pactness and regularity properties of holomorphic discs. In the present paper we 
give a proof of Cerf's theorem along the lines suggested by Eliashberg. Rather 
than simply filling in the details, we develop an alternative approach to the filling 
with holomorphic discs in a moduli-theoretic framework, for which the technical 
foundations have been laid in the magisterial monograph of McDuff-Salamon [22] . 

l 
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Here is an outline of the paper. In Section [2] we describe the basic set-up of 
Eliashberg's proof, and then indicate where our strategy differs from Eliashberg's. 
In Section [3] we recall Eliashberg's argument why it suffices to prove the extension 
result for contactomorphisms of the standard contact structure ( on 5 3 . 

In Section [4] we introduce the moduli space of holomorphic discs on which our 
proof of Cerf 's theorem is built. The boundaries of the holomorphic discs in question 
are required to lie in a family of totally real submanifolds; in a different setting 
such varying boundary conditions have also been investigated by Wendl [3D]- As 
described in Section [5l a suitable evaluation map on this moduli space then gives 
an 'explicit' extension of a given contactomorphism of (S 3 ,^) to a diffcomorphism 
of D 4 . Sections \6\ and [7] are devoted to establishing compactness and transversality 
results for our moduli space. 

In order to ascertain that the holomorphic discs in question are embedded, we 
need a relative adjunction inequality, which in turn relies on results about positivity 
of intersection. These are demonstrated in Section [9j preceded by an account of 
the topological intersection theory of discs in Section [H 

One important word about notation: following [22], and with apologies to hy- 
perbolic geometers, we write DcC and EI C C for the closed unit disc and upper 
half-plane, respectively. 

2. Idea of the proof 

Regard the 3-sphere S 3 as the unit sphere in C 2 with complex Cartesian coor- 
dinates (zi = x\ + iyi, Z2 — X2 + iy^)- Let H be the height function on S 3 given 
by projection onto the j/2-coordinate. For t 6 (— 1, 1) the level sets S 1 * := H~ l (t) 
define a smooth foliation of S 3 \ {(0, 0, 0, ±1)} by 2-spheres. We regard the points 



<&:=(0,0,±Vl-* 2 ,*) 

as the poles of these 2-spheres. 

This family of poles, together with the two poles (0, 0,0, ±1) of S 3 , forms an 
unknot 

K:= {(0,0,±Vl -t 2 ,t): i€[-l,l]} 
in S 3 . The complement S 3 \ K is foliated by circles that bound holomorphic discs 



^:=J} 4 n(Cx{ l2 = Sl!/2 =(}), |t|<l, \s\ <Vl-t 2 . 

For each t E (— 1, 1), the circles <9D* foliate the punctured 2-sphere S l \ {q±}', we 
write £t for this foliation by circles. 

Now let ip be an orientation preserving diffeomorphism of S 3 that we wish to 
extend over D 4 . Any isotopy from ip to a diffeomorphism ip can be swept out over 
a collar neighbourhood of S 3 in D 4 , where the collar coordinate serves as isotopy 
parameter. Thus, if ip extends over D 4 , then so does (p. In other words, it suffices 
to find an extension for a suitably well-behaved representative in the isotopy class 
of ip. 

By the disc theorem |121 Theorem 8.3.1], any orientation preserving diffeomor- 
phism of an Ti-manifold is isotopic to one that fixes any given embedded n-disc. 
So we may require without loss of generality that ip fix a neighbourhood of the 
unknot K. 

Write S l := ^{S 1 ) for the images of the 2-spheres S f under ip. Then (p(£t) is 
a foliation of the punctured 2-sphere 5* \ {q±} by circles. Now the key idea is as 
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follows. Suppose we were able to find a foliation of D 4 \ K by holomorphic discs 
with these properties: 

(1) The boundaries of the discs foliate the punctured 2-spheres S f \ {q±}- 

(2) This foliation of S l \ {q±} can be deformed to ip(£t)- 

Then the proof of Cerf's theorem T4 = would reduce to T 2 — 0, or more 
precisely the parametric version of the latter, which says that the restriction map 
Diff(Z? 2 ) — > Diff (S 1 ) is a Serre fibration with contractible fibre. 

As observed by Eliashberg, this plan is feasible, provided that ip is a contacto- 
morphism of the standard contact structure £ on S 3 . Then both </?(£*) and the 
foliation of S* \ {q±} given by the boundaries of holomorphic discs are transverse 
to the characteristic foliation St , so one foliation can be deformed into the other by 
an isotopy along the characteristic foliation. Moreover, Eliashberg's classification 
of contact structures on S 3 shows that any diffcomorphism of S 3 is indeed isotopic 
to a contactomorphism of (S 3 , £) — for the convenience of the reader we reproduce 
the proof of this fact in Section [3] — , so this assumption on ip is not restrictive. 

In the present paper, Eliashberg's programme is carried out rigorously. However, 
we deviate from his scheme in one important respect. We define a moduli space 
of holomorphic discs in D 4 C C 2 whose boundaries lie on the punctured spheres 
S l \ {q±}- The extension of the contactomorphism <p over D 4 is then defined in 
terms of an evaluation map on this moduli space. That way we obtain easier control 
over the smoothness of the extension. This is true in particular near the singular 
set K, where the behaviour of the diffeomorphism is clear from the observation that 
the filling with holomorphic discs adapted to ip coincides with the standard filling 
near K. 

The reader will notice that the vanishing of T 2 is not mentioned explicitly in 
our proof. This is a consequence of our working with holomorphic discs with three 
marked points on the boundary. Such discs come with a fixed parametrisation, so 
we need no longer worry about choices of diffeomorphisms of S 1 and their extensions 
toD 2 . 

In the following subsections we describe the backcloth of our proof, mostly to 
set up notation. 

2.1. Contact geometry of S 3 . Consider the differential 1-form 

A := -(xi dyi - yx dx Y + x 2 dy 2 - y 2 dx 2 ) 

on M 4 . The 1-form a := X\ts 3 on S 3 gives rise to a volume form a A da defining 
the positive orientation of S 3 (with S 3 oriented as boundary of D 4 C C 2 ), so a is a 
contact form. Its kernel £ := kera is the (positive) standard contact structure 
on S 3 . The 2-form ui := dX is the standard symplectic form on R . 

The characteristic foliation St induced by £ on the 2-sphere S l is the singular 
1-dimcnsional foliation defined by the intersection T p S t nS, p for p€S', with singular 
points where T p S l and the contact plane £, p coincide, which happens exactly at 
p = q l ± . Notice that the unknot K made up of these singular points and the two 
poles of S 3 is transverse to £. 

A contactomorphism of £ = kera is a diffeomorphism ip of S 3 such that 
ip*a — fa for some smooth function f : S 3 —>■ M. + . 
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In Section [3] below we also have to use the notion of a tight contact structure, 
and we appeal to Gray stability of contact structures. Both concepts are explained 
in HDJ. 

2.2. Strict pseudoconvexity of S 3 . Write Jo for the standard complex bundle 
structure on TC 2 induced by multiplication with i. The 3-sphere S 3 may be re- 
garded as the level set p _1 (l) of the strictly plurisubharmonic function 

p: C 2 ^K, (z u z 2 )^\ Zl \ 2 + \z 2 \ 2 . 

This implies that the complex tangencies T p S 3 r\ Jo(T p S 3 ) define a contact structure 
on S 3 , given as the kernel of the contact form — (l/2)dpo Jolrs 3 , cf. [TQl Chapter 5]. 
As is well known (and easy to check), this contact form equals the previously 
defined a. 

The essential consequence of S 3 being strictly pseudoconvex is that non-constant 
holomorphic discs in C 2 with boundary on S 3 have their interior in the interior 
of D , see Proposition 14.21 for the precise formulation of this maximum principle. 

The characteristic foliation St being non-singular away from g^_ is equivalent to 
iS* not having any complex tangencies except at these two poles. In other words, 
S l \ {q±} is a totally real surface. 



2.3. A holomorphic filling of S 3 . For \t\ < 1 and \s\ < y/1 — t 2 define a smooth 
real- valued function 



2 V / T rr t 2 \ Vl^i 2 



For each t this defines a diffeomorphism from (— yl — t 2 , yl — t 2 ) to M. Now 
consider the parametrisations 



u*(z) := (Vl-s 2 -t 2 -e l9 ^^ -z,s,t), z£D, 

of the holomorphic discs D*. The rotation factor e 10 ^^ has been chosen in such 
a way that each leaf of the characteristic foliation St (outside qj_) is parametrised 
by a map s >-> u*(z), \s\ < vl — t 2 , for some z € 9D. For this one simply needs to 
verify that the tangent vector 9 s u* (z) to S 3 (for z € 9D) lies in the kernel of the 
1-form x\ dyi — yi dxi — tdx 2 , which is the pull-back of a to S 1 . Putting all these 
parametrisations together, we obtain a diffeomorphism 

F st : (D x Int D, <9D x Int D) — ► (/J 4 \ K, S 3 \ K) 
{z,s,t) i — > ul(z). 

We can extend this map continuously over the boundary D x dH) by sending (z, s, t) 
with |t| < 1 ands = ±y/l - t 2 to q±, and (z,0,±l) to (0, 0,0, ±1). So the boundary 
is mapped onto K. 

This map F st will be our prototype of a holomorphic filling. The formal definition 
of such fillings will be given in Section [5j 

2.4. Symplectic energy. The symplectic energy of the holomorphic disc u* is 
defined as E(u l a ) — J D (u*)*w. One computes 

E{u\) = tt(1 - s 2 - t 2 ) < tt(1 - t 2 ), 

i.e. for each t we have a uniform bound on the energy. A general result to this effect 
will be proved in Proposition ^. 71 
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Writing z in polar coordinates as z — re 10 , we compute 

a^deul (z)) = (1 - s 2 - t 2 )/2 for z e dB. 

This gives another way to verify the computation of the energy, since L,(u*)*oj = 
J aD (M*)*a by Stokes's theorem. 

Since a evaluates positively on dgul(z) for z e <9ID>, the curve ul\do is posi- 
tively transverse to the characteristic foliation St. It is another consequence of 
the maximum principle that this holds true for arbitrary holomorphic discs, see 
Proposition 14.21 



2.5. Bundle pairs and Maslov index. The holomorphic disc u* has boundary 
on the totally real submanifold 5* \ {q±} of C 2 . So it defines a bundle pair in the 
sense of [HI Definition C.3.4], that is, a complex vector bundle (u*)*TC 2 over D 
(here: the trivial C 2 -bundle) and a totally real sub-bundle (w* IodTTS* over the 
boundary <9B. At the point u^e 10 ) € dDy, the fibre of this totally real subbundle 
is given by Kie © R. From the axiomatic definition of the Maslov index fi given 
in [H Theorem C.3.5] it follows that /i(it*) = 2 for all £ e (-1, 1) and |s| < \/l- t 2 . 

2.6. The standard neighbourhood of K. To avoid problems at the singular 
points g^_ , we design a set-up where the holomorphic filling of D corresponding to 
some contactomorphism of S 3 coincides with the standard filling F st near K . For 
8 € (0, 1), we define a neighbourhood of K by 

U s := F st (S 1 x {1 - S < s 2 +t 2 < 1}) C S 3 . 

The boundary 9W is a Lagrangian torus with a holomorphic filling given by the 
restriction of F st to D x {s 2 + 1 2 = 1 — S}. We shall require the contactomorphism 
of (S 3 , £) to act as the identity on some U s . 

2.7. The 2-discs Qk- In order to obtain a compact moduli space of holomorphic 
discs, one needs to quotient out the 3-dimensional automorphism group of ID or, 
alternatively, place a restriction on three marked points in D. We shall take the 
latter approach, using the three points i k € <9B, k = 0, 1, 2 as markers. 

We define three open 2-discs in S 3 by 

Q k :=F st ({i k }xlntB)cS 3 . 



These discs are transverse to the punctured 2-spheres <S"\{<7j_}, and the intersection 
is given by one leaf l\ of the characteristic foliation St ■ 

Notice that a contactomorphism <p of (S 3 , £) will map the characteristic foliation 
St to the characteristic foliation of the image sphere S* = ^(/S*). The discs Qk will 
be used in Lemma 17.21 to show that, as expected, imposing the condition that our 
holomorphic discs map i k into the leaf t\. :— ip(£j.) will cut down the dimensions of 
the moduli space by 3. 

We also consider the smaller closed disc 

Q s := F st ({l} x {s 2 +t 2 <1-5})C Q Q . 

In Proposition 15.11 we shall set up a diffeomorphism between the disc <p{Q 5 ) and 
the moduli space of holomorphic discs adapted to tp, with three marked points and 
boundary outside the neighbourhood U s . 
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3. Reduction to contact geometry 

In order to construct an extension of a given diffeomorphism of S 3 to a diffeo- 
morphism of D 4 , we first want to isotope it to a diffeomorphism adapted to the 
set-up of the previous section. 

Proposition 3.1. Any orientation preserving diffeomorphism of S 3 is isotopic to 
a contactomorphism of (S 3 ,£,) that fixes an open neighbourhood U of K pointwise. 

Proof. Let ipo be a given orientation preserving diffeomorphism of S 3 . Then Tip(£) 
is a positive tight contact structure on the 3-sphere. By the uniqueness of such 
structures up to isotopy, and with Gray stability, we can isotope ipo to a contacto- 
morphism ipi of £, cf. |10| Lemma 4. II. I]. 
There is a contactomorphism 

{R 3 ,ker(dw + udv-vdu)) — > (S 3 \ {(0, 1,0, 0)},£) 

which sends the unit circle C in the ww-plane to K. An explicit description of such 
a contactomorphism can be found in [10[ Proposition 2.1.8]; simply compose the 
contactomorphism 

(R 3 ,ker(dw + udv-vdu)) — > (S 3 \ {(0,0,0, 1)},£) 

described there with the contactomorphism of (S 3 , £) induced by (zi, z-i) i-> (22, Z\). 

This contactomorphism restricts to a contact embedding j of a ball B of radius 2, 
say, in (R 3 ,ker(dw + udv — vdu)) into (S 3 ,!;), sending C to K. A second contact 
embedding of this ball is given by <p\oj. By the contact disc theorem [TQl Theorem 
2.6.7 and Remark 2.6.8], there is a contact isotopy whose time-1 map i\)\ is a 
contactomorphism of (S 3 , £) such that ipx o ipi o j = j. 

So our initial diffeomorphism (fo is isotopic to the contactomorphism ipx o tpi, 
which fixes the neighbourhood U := j(int B). D 

The isotopy from ipo to tp\ o ipi can be swept out over a collar neighbourhood of 
S 3 = dD 4 in D 4 . Hence, Cerf's theorem will follow if we can find an extension of 
■01 o ipi to a diffeomorphism of D 4 . 

So from now on we consider a contactomorphism ip of (S 3 , £) that fixes an open 

neighbourhood U of K pointwise, and we choose 8 £ (0, 1) such that the closure U 
of the standard neighbourhood U b is contained in U. 

4. Bishop discs 

In Section [2] we described a simple filling of the 3-sphere by holomorphic discs, 
i.e. a foliation of D 4 \ K by discs with boundary in S 3 \ K. We now want to 
construct another such filling, one that is related to the contactomorphism <p. 

We begin by introducing the corresponding boundary value problem. For fixed 
t E (— 1, 1) we are looking for smooth (up to the boundary) solutions 

u *: (B, flD) —►(!>*, 5* \{e&}) 
of the homogeneous Cauchy-Riemann equation du l = (in other words: holo- 
morphic discs) having boundary values in S* := <p(S ). We remarked before that 
the punctured 2-spheres S" \ {q±} are totally real submanifolds of C 2 . Hence, 
so are their images £r \ {q±} under ip, since tp preserves the contact structure 
£ = TS 3 nJ (TS 3 ). 
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In addition, we require the relative homotopy class [u*] G ^(Z? 4 ,^* \ {q±}) to 
equal A 1 , which we define as the class in ^(-D 4 , S 1 * \ {g±}) that maps to the class 
of [(po Uq|qd] G ^1(5' \ {g±}) under the boundary homomorphism <9». (This notion 
of relative homotopy class will be relevant for our discussion of Maslov indices, and 
our more general set-up in Section [9]) Moreover, we fix three marked points by 
imposing the condition u*(i fc ) G P k , where it := <p(£V), k = 0, 1, 2, are three leaves 
of the characteristic foliation St = ip(Sf). Such a holomorphic disc will be called a 
t-level Bishop disc for ip. The collection 

W v := {u : t 6 (— 1, 1), u is a i-level Bishop disc for y>} 

of all such discs is the moduli space of Bishop discs. We write M. v {t) C W v 
for the moduli space of i-level Bishop discs for ip. 

Section[2]can be read as a description of the structure of Wia- We are now aiming 
for a similar description of W v for any contactomorphism tp as in Proposition 13.11 

In the following propositions we collect some relevant facts about our Bishop 
discs. 

Proposition 4.1. Every Bishop discs for ip has Maslov index 2, that is, n(A ) = 2 
for allte (-1,1). 

Proof. Recall from Section l2~5l that that the Maslov index of u equals 2 for all t G 
(— 1, 1). Since the boundary homomorphism 9» : ^(-D 4 , S* \ {q±}) —> ■K\{S t \{q t i: }) 
is an isomorphism, and the Maslov index is invariant under homotopies, it suffices 
to show that <9* A* — 9*[wg]- 

For a given t choose s E (—1,1) with 1 — <5 — i 2 < s 2 < 1 — t 2 . Then, since ip = id 
on li s , we have \ip o u* |an] = [u*|9d]- Hence, we obtain the following sequence of 
equalities, where the first one holds by definition of the class A 1 . 

<%A* = [(poullan} = [tpoullen] = [u*|a D ] = Klan] = d*[u ]. 
This proves the proposition. D 

In the sequel we have to appeal several times to the maximum principle for 
holomorphic discs in C 2 with boundary on S s . We briefly recall the statement in 
the form needed for our purposes. This maximum principle also explains why we 
could define our Bishop disc from the outset as maps into the 4-ball D 4 rather 
than C 2 . 

Proposition 4.2 (Maximum principle). Let S C S 3 be a totally real surface in C 2 , 
so that the characteristic foliation E^ is non-singular. Letu: (B, 9D) — > (C 2 ,S) be 
a non-constant holomorphic disc. Then u maps the interior o/D to the interior of 
the 4-ball D 4 , and u\ga is an immersion positively transverse to Sj. 

Proof. A straightforward computation, cf. [TUJ Lemma 4.11.3], gives 

A(po u) — 2d\(u x , Jqu x ) > 0. 

For the definition of A and p see Sections 12. II and [2. 21 respectively. Then the strong 
maximum principle of E. Hopf, cf. [261 Section 2.3], shows that pou = \\u\\ 2 cannot 
attain a maximum in the interior of D. The boundary point lemma {ibid.) implies 
that u is transverse to S 3, along the boundary. Since the characteristic foliation 
S^ is given by the intersection of the complex tangencies £ to S 3 with S, and u is 
holomorphic, this forces u|an to be an immersion transverse to Sj. The statement 
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about this immersion being positively transverse to £5 follows because the complex 
structure Jo sends the outer normal of S 3 C C 2 to a vector positively transverse to 
£ = kera. □ 

Remark. Suppose M is a compact level set M = /0 _1 (1) of a smooth function 
p: W — > R on an almost complex manifold (W,J), where 1 and nearby values 
are regular for p; any compact orientable codimension 1 submanifold of W can be 
realised in this way. The level set M is called J-convex if the complex tangen- 
cies to M define a positive contact structure, that is, if — dp o J\tm is a contact 
form inducing the natural orientation of M. One then computes that for a smooth 
function / : R — > R with /" ^> /' > the composition / o p is strictly plurisubhar- 
monic, i.e. —d(d(f o p) o J)(X, JX) > for all nonzero tangent vectors X £ TW 
in a neighbourhood of M, From this observation one concludes that the maximum 
principle holds, mutatis mutandis, for J-convex hypersurfaces. 

Following Lazzarini [20], cf. [22] Appendix E], we say a holomorphic disc u is 
simple if there are no two disjoint non-empty open subsets U,FcD such that 
u(U) =u{V). 

Proposition 4.3. All Bishop discs are simple. 

Proof. The maximum principle tells us that u\qb'- <9B — >■ S 1 * \ {g±} is an immersion 
transverse to the characteristic foliation St. By the homotopical condition on u\q-b 
and the nature of the characteristic foliation Si, the restriction of u to 5B must 
then in fact be an embedding. 

From this observation one concludes that u must be simple; since the argu- 
ment for this last step is of interest elsewhere, we formulate it (in slightly greater 
generality) as a separate lemma. □ 

Lemma 4.4. Let (W, J) be an almost complex manifold. If u: B —¥ W is a J- 
holomorphic disc such that u\qq is an embedding and no interior point of B is 
mapped to u(<9B), then u is simple. 

Remark. In the case of our Bishop discs, u\do is an embedding into the strictly 
pseudoconvex boundary S 3 of D 4 ; the condition on the interior points of ID is then 
guaranteed by the maximum principle. 

Proof of Lemma \4-4\ Arguing by contradiction, we assume that there are two dis- 
joint non-empty open subsets U, V C B with u(U) = u(V). By the conditions of 
the lemma, U and V are disjoint from 9B. In the proof of J22J Lemma 2.4.1] it is 
shown that critical points of u in the interior of B are isolated. Since u|gn is an 
embedding, critical points cannot accumulate near <9B, so there are only finitely 
many of them. Choose a non-critical point z* £ U and a half-open line segment 
[z*,z**) from z* to a point z** £ <9B, disjoint from the critical points of u. Let 
P C [z*, z**) be the set of points that have a neighbourhood in [z* , z**) consisting 
of non-injective points for u, that is, points z for which there is a different point 
z' e B with u(z) — u(z'). By definition P is open in [z*,z**), and z* £ P, so P is 
non-empty. 

We claim that P is also closed in [z*, z**). Indeed, suppose that (z v ) is a sequence 
of points in P converging to some point zq £ [z*,z**). Then there are points 
w v =/= z v with u(w u ) = u(z v ). By passing to a subsequence we may assume that 
the sequence (w v ) converges to a point wq £ B. 
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Since the point zq is not critical for u, the map u is locally injective near zq . This 
implies that wo ^ zo, and without loss of generality we may assume that w v 7^ zo 
for all v. Moreover, the assumptions of the lemma imply that wq is an interior 
point of ID. 

With this information we are precisely in the situation of [22l Lemma 2.4.3], 
which tells us that there exists a holomorphic map 0, defined in a neighbourhood 
of zq, such that <fr(za) — wq and u = u o <f>. So zq has a neighbourhood of non- 
injective points, which means that zq € P, i.e. P is closed in [z*,z**). 

We conclude that P = [z* , z**). In particular, we find a sequence of non-injective 
points, which we write again as (z v ), accumulating at z**. So as before we have 
points w v 7^ z v with u(w u ) — u{z u ), with (w v ) converging to some point wq. Then 
u(wo) — u(z**), which implies wo — z**. But u is locally injective near z**, so it 
is impossible for the two sequences (z„) and (w v ) of corresponding non-injective 
points to accumulate at z**. This contradiction proves the lemma. □ 

Remark. Instead of asking for m|od to be an embedding, it suffices to require that 
u\qo be injective. Then the set of critical points is still finite, see [22 Theorem 3.5], 
and the proof above goes through verbatim. 

Proposition 4.5. All Bishop discs are embedded and mutually disjoint. 

Proof. Since all Bishop discs are simple, the results from Section [9] below on the 
positivity of intersections apply. In that section we define a so-called embedding 
defect D for holomorphic discs. This embedding defect depends only on the relative 
homotopy class A 1 = [1/] G ^(D 4 , S* \ {q±}), and it equals zero if and only if u l is 
embedded (Theorem 19. 41) . 

For each t e (—1,1), the discs u\ with s in the range given by 1 — 5 — t 2 < 
s 2 < 1 — t 2 belong to our moduli space A4 V (£) of t-lcvcl Bishop discs. In particular, 
M. v (t) contains at least two disjoint embedded holomorphic discs. It follows that 
D(A t ) — and that all Bishop discs are embedded. 

Given two Bishop discs of distinct levels, their boundary curves are disjoint. 
Then the intersection number of the two discs equals the linking number of their 
boundary curves in S 3 , which is zero. By positivity of intersections (at interior 
points), see [23l Theorem 7.1], it follows that the two discs must be disjoint. 

It remains to show that the same holds true for Bishop discs at one and the same 
level t. From the defining equation of the embedding defect in Section [9l and with 
^(A 1 ) = 2, it follows that the self-intersection number A 1 • A f , as defined in Sec- 
tion [51 equals zero for every t. (Alternatively, this is a consequence of the existence 
of two disjoint t-level Bishop discs.) With positivity of intersections (Theorem 19. 2[) 
we infer that any two i-level Bishop discs are either disjoint, or their images coin- 
cide. Since we prescribed the images of three marked points on each Bishop disc, 
two Bishop discs with the same image are actually identical. □ 

Corollary 4.6. // the boundary of a t-level Bishop disc u hits the set U , then 
u = u\ for some s with 1 — 5 — t 2 < s 2 < 1 — t 2 . 

Proof. The contactomorphism ip is the identity on U, which contains U . That 
latter set is foliated by the boundary circles of standard discs u\ . Now invoke the 
preceding proposition. □ 
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It is therefore opportune to restrict attention to the truncated moduli space 



W s v := {u*: t e [-VT=8, VT^}, 

u l is a t- level Bishop disc for ip such that u'(9D) C5 3 \W S }. 

In Sections [6] and [7] we shall prove that this moduli space is a compact manifold 
with boundary. Working with this truncated moduli space allows us to circumvent 
all subtleties along the singular set K. 

The following uniform energy estimate will be one ingredient in that compactness 
argument. 

Proposition 4.7. The symplectic energy of all Bishop discs for p is uniformly 
bounded by a constant depending only on p. 



Proof. By Proposition 14.51 any Bishop disc u is an embedding. So with Stokes's 
theorem and the transformation formula, the energy E(u) can be computed as 



E(u) = I u*uj = I a = I p*a. 

JD Ju(dD) J v - 1 ou(dO) 

The contactomorphism (p pulls back the contact form a to fa, where /: S 3 — > 
R + is a smooth function bounded above by some constant C = C(p). By the 
maximum principle, u\go is positively transverse to £ = kera, and the same holds 
for tp^ 1 o u\ao. It follows that 



E(u) <C I 

Jin — 1 Oti 



The 2-sphere S 1 , which contains the embedded circle p x o u(dH>), is naturally 
oriented by the area form 

: (xi dyi A dx2 + j/i dx-i A dx\ + X2 dx\ A dyij ; 



the total area of 5* equals 47r(l — t 2 ). Let D l be the 2-disc in 5' (with the induced 
orientation) whose oriented boundary equals p~ x o u(dU>). Then 

E{u) <C I a = C I u. 

JdD* JD t 

On TS we have x\ dx\ + y\ dy\ + X2 dx2 — 0. Using this, one finds that 
x 2 <r t = \Jl - t 2 dxx A dyi = yj\-t 2 uj on TS 1 . 



Hence, with X2 < vl t 2 we conclude 

E(u) < ° [ X2<J l <C [ o* = 47rC(l - t 2 ) < AttC. 
V 1 - t 2 Jd* Js* 

This is the desired uniform estimate. □ 

Remark. The symplectic energy E(u) of a holomorphic disc equals its Dirichlet 

energy (1/2) J D |Vw| 2 see [HI Section 2.2]. Since the symplectic energy is, by its 
definition, invariant under reparametrisations, it follows that the Dirichlet energy 
is invariant under conformal reparametrisations. 
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5. From the filling to the extension 

By taking the results on compactness and transversality from Sections [6] and [JJ 
for granted, we can now determine the truncated moduli space. 

Recall from Section 12.71 that Q 5 is a closed 2-disc in S* 3 defined by Q s = 
{w|(l): (s,t) G Hi-*}, where Di_«5 denotes the closed disc {s 2 + t 2 < 1 - §}. 
Hence, if u is a Bishop disc for ip, our condition on the marked points implies 

u(i) e v{Q 5 )- 



Proposition 5.1. The evaluation map 



evi: Wi — > ^(Q 5 ) 
-* «(1) 



is a diffeomorphism. 

This proposition will be proved at the end of Section [71 once we have established 
the relevant compactness and transversality results. 

We now want to define a notion of holomorphic filling of S* 3 (with singular set K) 
with respect to a contactomorphism ip of (S 3 ,£) that subsumes, for the identity 
map, the standard filling F s t. In the sequel, the contactomorphism ip is always 
taken as a given and will be omitted from the notation. 

Definition. A holomorphic filling of S 3 is a diffeomorphism 

F: (D x kit B, <9B x Int D) — > (D A \K,S 3 \K) 
with the following properties: 

(1) 8F(.,s,t)=0ioi all (s, t) € Int B; 

(2) F extends continuously to a map (still denoted F) defined on B x B such 
that F(B x 9B) = if; 

(3) for all (z, s, t) e 9B x IntB we have F(z, s, t) <E 5* \ {?±}, and as s tends 
to ±vl — i 2 , the limit of -F(z, s j *) equals q!j_ for each z G dB. 

Observe that for t 6 (— 1, 1), the punctured 2-sphere S* \ {<7±} will be foliated 
by the boundary circles F(dU>, s, t) of the holomorphic discs F(B, s, t), where \s\ < 

We now want to construct such a holomorphic filling with the help of the mod- 
uli space W£. The map described in the next proposition is defined thanks to 
Proposition 15. II 

Proposition 5.2. The map 

F s : (B x Bi_4, dB x Bi_ 5 ) — ► (L> 4 , S 13 ) 



(2,*,t)i— > (ev 1 1 o( P ou*(l))(z) 



defined by 

is an embedding. 

Again, the proof of this proposition relies on compactness and transversality 
statements and will be given at the end of Section [Jj 

For (s,t) in a neighbourhood of dUi-s C Bi_a we have F s (z,s,t) = w'(z) = 
F B t(z,s,t). Therefore we obtain a holomorphic filling by setting 

F= \F 5 on DxD M , 

\F st on Dx{l-Ks 2 + ( 2 <1}. 
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By construction, the maps Fst and F restrict to diffcomorphisms 

/, / st : (S 1 x ©!_«, S 1 x dB^ s ) — > (S 3 \ W*, c«A 

Observe that 

/o/- 1 : S 3 \W a — >S 3 \W 4 

is a diffeomorphism equal to the identity near the boundary of S 3 \ U . By slight 
abuse of notation, we regard /° / s 7 as a diffeomorphism of S 3 , equal to the identity 

in a neighbourhood of U . 

Lemma 5.3. The two diffeomorphisms f o f~ and ip of S 3 are isotopic relative 
to a neighbourhood ofU . 

Before proving this lemma, we show that Cerf's theorem is now an immediate 
consequence. 

Proof of Cerf's theorem T^ — 0. Let V s be the open subset of D 4 defined by 

V s := F st (D x {1 - 6 < s 2 + 1 2 < 1}). 

Notice that V s n S 3 = U & . Define G: D 4 -)• D 4 by 

on D 4 \ V s , 




on V s . 

This is a diffeomorphism of D 4 that restricts to / o f^ 1 on S 3 . 

Now an extension of the diffeomorphism ip: S 3 — > S 3 to D 4 is defined by sweep- 
ing out the isotopy to / o f^ over a collar neighbourhood of S 3 in D 4 , and then 
extending over the remaining 4-ball as the diffeomorphism G. □ 

Proof of Lemma \5.3[ Consider the diffeomorphism x of 5 1 x His defined as the 
composition \ — f^ t 1 otp~ 1 of. For ease of notation we identify S 1 with R/27rZ and 
write x — x{@i s ^)- This diffeomorphism \ equals the identity near the boundary. 
Our condition on the marked point 1 E S 1 = <91D> (corresponding to 8 = 0) translates 
into saying that x(0,s,t) = (0, s, t) for all (s,t) € ID>i_«. 

Write the components of \ as X — (x 1 ^ 2 )^ 3 )- The i-level Bishop discs for ip 
have boundary in S f = y(5'*). This implies x 3 (#> s > *) = i- 

The map / st sends the s-curves to the leaves of the characteristic foliation St, 

hence ipo f st sends those curves to the leaves of Si. On the other hand, the 0-curves 
are mapped by / to curves positively transverse to the leaves of Si , thanks to the 
maximum principle. We conclude dex 1 > 0- This implies that, for each fixed t, the 
images of the 0-curves under x & re graphs of functions s = s(9). 

The idea mentioned in Section [2] of isotoping the boundaries of the Bishop discs 
for if to the images under <p of the boundaries of the standard Bishop discs translates 
into the isotopy 

(0,*,t)>— ►(X 1 ,(1-<0X 2 +*M), o-€[0,l]. 

The condition dex 1 > an d the resulting fact that x niaps each 0-curve to a graph 
guarantee that this is indeed an isotopy, stationary near the boundary; thanks to 
this boundary behaviour it is enough to observe that for each a £ [0, f] the given 
map constitutes an injective immersion. 
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For each fixed s, t we may regard the function 9 1-4 x 1 (0, s,i) as a strictly in- 
creasing function [0, 2tt] — > [0, 2tt] sending both and 2tt to itself. With this 
interpretation, an isotopy from the map above for a = 1 to the identity is given by 

(0, s, t) .— > ((1 - r)* 1 + t9, s,t), re [0, 1]. 

Again, this isotopy is stationary near the boundary. 

In conclusion, we have found an isotopy from \ to the identity, stationary near 
the boundary, which is equivalent to having the isotopy claimed in the lemma. □ 

6. Compactness 

In the present section we wish to show that the truncated moduli space W* is 
compact; in the next section we find that it is a manifold with boundary by proving 
surjectivity of the relevant linearised Cauchy-Riemann operator. For both these 
analytical questions it is convenient and customary to work not with smooth maps, 
but with the space W 1,p , for some p > 2, of (equivalence classes of) L p -functions 
whose first partial derivatives in the weak sense exist and are p-integrable. 

This approach is justified by elliptic regularity, see [22l Appendix B] : (i) a holo- 
morphic curve of class W x,v will actually be smooth up to the boundary; (ii) a 
sequence of holomorphic curves that converges in the W^'^-norm also converges 
locally uniformly with all its derivatives. 

Thus, we now equip the moduli space W v with the topology induced by the 
W rl ' p -norm, p > 2, on maps D — > D 4 . In the Banach space W /1 ' P (ID), IR 4 ) we then 
have tools such as the implicit function theorem at our disposal. 

Let (u l/ ) 1 with u v of level t„, be a sequence in the truncated moduli space W* 
converging to u <G M^ 1,P (D, D 4 ). By passing to a subsequence we may assume that 
the sequence of levels (ty) converges to some io € [— \/l — 8, y/1 — 6]. By what we 
just said, (w„) converges with all derivatives to uq. Thus, uq is again a holomorphic 
disc. Moreover, the boundary circles u v {dl$) C S tu \ {q±} are homotopically non- 
trivial and stay outside the neighbourhood U s of the poles q±. So the same will be 
true for uo- In other words, Wt, is a closed subset of W 1,p (Jb, D 4 ). 

Our proof of the following proposition uses methods from |14j , cf. in particular 
the proof of Proposition 3.15 in that paper. 

Proposition 6.1. The truncated moduli space W* is compact. 

Proof. Let (ii„) be a sequence in W^, where u v is of level t v . After passing to 
a subsequence we may assume that t v — > t e [— yT — 5,\/\ — 5}. By (22j Theo- 
rem B.4.2], in order to prove compactness (i.e. to find a converging subsequence 
with respect to the T^ 1,p -norm), we need to establish a uniform L p -bound for the 
sequence (IVm^I). We claim that the sequence (|Vm„|) is uniformly bounded even 
in the suprcmum norm on the closed disc D. 

Arguing by contradiction, assume that such a uniform bound does not exist. 
We can then find a sequence of points z v — > z in IB) such that IVw,,^)! — > 00. 
The classical convergence theorems of Montel and Weierstrafi preclude this at in- 
terior points: the maximum principle provides us with a C°-bound on (u u ) needed 
for Montel's theorem, which then guarantees the existence of a locally uniformly 
convergent subsequence on Int ID); the theorem of Weierstrafi tells us that the limit 
function u is holomorphic and \Iu v {z v ) — > Vm(zo), which means that this sequence 
of gradients remains bounded. 
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So necessarily zq £ <9B. Choose a conformal map from the closed upper half- 
plane H C C to D \ {— zo} that sends to zq. (This extends to a conformal map 
from H U {00} cC:=CU {00} to ID.) The differential of this map is bounded from 
above and below near £ H, so by precomposing with this conformal map we may 
regard the u v , by slight abuse of notation, as maps 

u v : (M,R)^(D\S u \{q^}), 

and the sequence (z u ) as a sequence in H converging to 0, still satisfying 

R v := \Vu v (z v )\ -t 00. 

Notice that by Proposition ^ . 71 and the remark following it we have a uniform bound 
on the Dirichlet energy, i.e. there is a constant C such that 

|VuJ 2 < C for all v £ N. 



2 Ji 

Choose a sequence e v \ such that e^i?,, — > 00, e.g. e v = l/hxR v . Hofer's 
Lemma [16| Lemma 6.4.5], applied to the continuous function z t— > \Vu„(z)\, allows 
us to modify the sequences (z„) and (e„) in such a way that, in addition to the 
previous conditions, we have the uniform estimates 

|Vu„(z)| < 2R V for all z £M with \z - z„\ < s v . 

After passing to a further subsequence, we may assume that 

R v dist(z„,cffl) -> r for some r g [0, 00]. 

When we write z v — x v + iy v , this reads as R v y v — > r. We shall deal separately 
with two cases: either r < 00 or r = 00. 

First case: r < 00. (i) In a first step we are going to show that a rescaled 
subsequence of (u v ) converges to a holomorphic disc. In a second step we then 
prove that this limit disc has contradictory properties. Replace the sequence (u v ) 
by the rescaled sequence (w v ), defined by 

w v (z) •— u v {x v + z/R v ). 

Since this amounts to a conformal change of parameters, the Dirichlet energy of 
the w v is still bounded by C. The uniform gradient estimate now becomes, with 

\Vw y (z)\ < 2 for all z £ H with \z - £,| < e v R v . 

Notice that (, v — > ir and IVwuiCu)] = 1< 

We wish to apply [22j Theorem B.4.2] in order to extract a Cj^ c -converging 
subsequence of (w u ). That theorem allows us to have varying almost complex 
structures, but requires a fixed boundary condition. Therefore we modify our set- 
up as follows. Choose a sequence (^ v ) of diffcomorphisms of C 2 with the following 
properties: 

(1) ^y equals the identity outside a ball of radius 2, 

(2) *„ -> id in the C°°-topology, 

(3) *„(5*-) = £*° and * v (^) = gi°. 

Such ^v can be constructed by suitably cutting off the gradient flow of the function 
H o ip^ 1 on S 3 , where H is the height function defined in Section [2] 
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Set w v :— ^ u o w v and J„ := T^ v o J o T^l~ \ so that J v — > Jo. Thus, we now 
have a sequence (w v ) of J„-holomorphic maps 

w v : (M,R)^(D\S t0 \{q^}). 

The u)„ are still subject to the same estimates as the io„, possibly after replacing 
the relevant constants by larger ones depending only on the sequence (^ v ). 

Now [HI Theorem B.4.2] does indeed permit us to select a subsequence of (w v ) 
converging in C££, to a non-constant Jo-holomorphic map 

w: (M,R)^(D\S t0 \{q^}). 

The cited theorem requires the boundary condition to be given by a closed totally 
real submanifold. No problems arise from our working with the punctured 2-sphere 
S to \ {q± }, since we know a priori that w v sends the boundary circle <9H U {00} to 
S f "\U s . 

The Cj^-convergence ensures that the energy of the limit w is estimated from 
above by C. By removal of singularities, see [2Ql Appendix B] and [2TJ Theo- 
rem 4.7.3], w extends to a holomorphic map defined on HU {00}. By reversing our 
change of conformal parameters, we regard this again as a map 

w: (D,0D) — ■» (D 4 ,S to \ {q^}), 

i.e. we have an honest holomorphic disc. 

(ii) We now want to show that such a holomorphic disc w cannot possibly exist. 



As we saw in the proof of Proposition ^. 3[ the map u\dn is an embedding transverse 
to the characteristic foliation Sl° . 

The leaf space of the characteristic foliation Si, for each t € (— 1, 1) and with the 
singular points q± removed, can be identified with S 1 , where 9 € S 1 corresponds 
to the leaf s h> ip o F st (e l9 , s, t). Let a be the homogeneous measure on S 1 of total 
measure 1, say. 

We now regard u\eo as a map from <9ED to the leaf space S 1 of Sc° ; the maps u u \qm 
and iiv|,9H will be viewed similarly. Choose a segment I C S 1 \ {— zq\ containing zq 
such that <t(u(I)) > 15/16. Under the identification of 9D\{— zq} with dM we take 
/of the form/ = [-R,R]. 

Because of the C^-convergence we have a{w v {I)) > 7/8 for v sufficiently large. 
With I v := [x v - R/R u , x v + R/R u ] this means that a{u v (I u )) > 7/8. The length of 
the interval I v tends to zero, so at least one of the three intervals in <9HU{oo} = 3D 
between the three marked points 1, i, —1 will be disjoint from I v for v sufficiently 
large; call this interval /'. The condition on marked points of our holomorphic 
discs implies that a{u u {I')) = cr{I'), which equals 1/4 or 1/2. Hence a{u l/ (I)) + 
<j(uy{I')) > 1, contradicting the fact that u u (I) na„(I') =0. 

This contradiction completes the proof in the first case. 

Second case: r = 00. In this case we define the rescaled sequence (w v ) by 

w u (z) := u v (z v -(- zjR v ) for z — x + iy with y > —y v R v . 

Then |Vuv(0)| = 1, the Dirichlet energy of the w v is bounded by C, and we have 
the uniform estimate 

\Vw v (z)\ < 2 for all zeH with \z - z v \ < e v R v . 
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Again we may appeal to [22[ Theorem B.4.2]; this now gives us a subsequence of 
(«;„) that converges in C^ c to a holomorphic map w: C — > C 2 , which is bounded 
by 1 and non-constant because of |Vw(0)| = 1. By the classical Hebbarkeitssatz of 
Riemann, now applied to a conformal chart of C = C U {oo} near oo, we obtain a 
non-constant holomorphic sphere w : S 2 —t C 2 , in contradiction to the maximum 
principle. D 

Remark. With a view towards potential generalisations of this result it is op- 
portune to remark that the references to classical theorems of complex analysis 
can be substituted by results that hold for almost complex structures tamed by a 
symplectic form. 

For proving that (|Vw„|) is uniformly bounded one may replace the reference 
to the convergence theorems of Montel and Weierstrafi by arguments from [22j 
Section 4.2]. For z v — > zq E IntO and |Vu„(z„)| — > oo these arguments would allow 
one to infer the existence of a bubble, i.e. a non-constant holomorphic sphere in C 2 , 
in violation of the maximum principle. 

In place of Riemann's Hebbarkeitssatz one may cite |221 Theorem 4.1.2]. 

7. Transversality 
Here is the main result of the present section: 

Proposition 7.1. The truncated moduli space of Bishop discs W°, is a 2-dimensio- 
nal manifold with boundary dWt, — {u\ : s 2 + t 2 = 1 — 6}. 

For the proof of this proposition we shall establish automatic transversality re- 
sults as in [11] and [15]; the attribute 'automatic' refers to the fact that no per- 
turbation of the (almost) complex structure is required to guarantee surjectivity of 
the Fredholm operator in question. 

We initially drop the condition on the three marked points. Thus, write W v 
for the free moduli space of all i-level Bishop discs for ip, where t varies in the 
interval (— 1, 1), but now without any restriction on the image of the points 1, i, —1. 
Our aim will be to show that W v is a 5-dimensional manifold. Once this has been 
achieved, the following simple lemma allows us to deduce that our original moduli 
space W v is a 2-dimensional manifold. 

The tangent space T u W hp (B,R 4 ) to the Banach space W 1,P (B,R 4 ) at some 
point u of that Banach space can of course be naturally identified with the Banach 
space itself. When the target space is not a linear space, but some manifold W, the 
tangent space T U W^ P {B,W) to the Banach manifold W 1,P (B,W) is the space of 
W 1 ' p -sections of the pullback bundle u*TW ', cf. [12 Chapter 3] or Section 6.2]; 
in other words, a tangent vector 77 at u is a map r\: B — > TK 4 with r\{z) € T U ( Z )R 4 . 
We also want to consider Banach manifolds of relative maps (B, 9B) — > (R 4 ,E), 
where £ C S 3 is a totally real submanifold. Then tangent vectors are sections of the 
bundle pair (cf. Section[2|) u*(TM 4 , TE). The details of this relative case are worked 
out in [32, Section 3.1] . The key point here is to choose an auxiliary metric for which 
£ is totally geodesic in M 4 . Then the composition of a section of u*(TM. 4 , TE) with 
the exponential map for this metric will be a map (B, 9B) — > (M 4 , E). 

In particular, if m T , r € (— e,e), is a holomorphic reparametrisation of B, with 
mo = idn, we may regard i] := (d/ 'dr)\ T= o(u o m T ) as a tangent vector in this sense; 
here rj(z) is actually tangent to u(B) at u(z); and for z € 9B, tangent to u(dU>). 

See Section 12.71 for the definition of Qt used in the following lemma. 
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Lemma 7.2. The evaluation map 

ev M ,_i : W v — » S 3 x5 3 x5 3 
u i — ► (u(l),u(i),u(-l)) 

is transverse to Qo x Qi x Q2; where Qk '■— (p(Qk)- 

Proof. Observe that W v = evj~^_ 1 ((5o x Qi x Q2), so we need to investigate the 
differential of the evaluation map at points u £ A1 v (t) C W v . This differen- 
tial sends a tangent vector 77 to the tangent vector (77(1), 77(1), r/(— 1)) at the point 
(u(l), u(t), tt(-l)) eS 3 xS 3 xS 3 . 

Let TTi*, fe = 0, 1,2, r e (— e,e), be the 1-parameter family of Mobius transfor- 
mations of D uniquely determined by the condition that the marked point i k be 
sent to e lT i k and the other two marked points be fixed. 

Recall that, by the maximum principle, u\go is an embedding transverse to the 
characteristic foliation St. The leaf i\ of this characteristic foliation is given by the 
transverse intersection of S l and the disc Qk- This implies that u\oo is transverse 
to Q k . 

For each k £ {0, 1, 2}, the tangent vector r\ k := (d/dr)| T=0 (u o m k ) maps, under 
the differential of the evaluation map, to (n k (1) , n k (i) , r] k (~1)) . By what we just 
said, the tangent vector n k {i k ) is transverse to Qk C S 3 , and n k (i l ) — for I 7^ k. 
This proves the lemma. D 

We now begin with a systematic description of the analytic setting in which 
we want to formulate the transversality results that will imply Proposition 17.11 
We work with discs of all levels in (—1,1), although it would suffice to restrict to a 
slightly smaller interval, since Bishop discs of level t with \t\ > \/l — S are standard. 

By the Sobolev embedding theorem, each equivalence class of maps of class 
W 1 '? has a continuous representative for 1 ■ p > dimR© = 2. We think of this 
representative when we speak simply of a map of class W 1 ' 13 . This allows us to 
introduce the following spaces. 

Notation. Let C C W 1 ' p (B,M i ) be the subset of maps u: (D, 9B) -* (M 4 , S* 3 \ K) 
such that u \ go : 9D —t S 3 \ K ~ S 1 lies in the homotopy class of Mglan; where u® 
is one of the standard Bishop discs. 

Let B C C be the subset of maps u such that u|aB maps dU> into a punctured 
sphere S f \ {q±} for some t £ (— 1, 1). 

Observe that W v = {u £ B: du = 0}. Contrary to appearances, the equation 
du = on B is not a linear one, since the boundary conditions are not linear. 

Proposition 7.3. The spaces B and C are Banach manifolds. 

Proof. We begin with C. Here we just give the main points; for more details see [32] . 
First we consider VF^-maps u: (D, 31D>) — > (IR 4 ,^ 3 ). Choose a metric on R 4 for 
which S 3 is totally geodesic. Then the composition of a section of u*(TK. 4 ,TS 3 ) 
with the exponential map for this metric will be a map (B, 9D) — !• (R 4 ,.? 3 ). This 
can be used to show that we obtain a Banach manifold modelled on the tangent 
space T U C of W^-sections of u*(TR 4 , TS 3 ). 

Given u £ C, the set S 3 \ K is an open neighbourhood of u{dW) in S 3 . It then 
follows from the Sobolev embedding theorem that the space of maps (B, 9B) — > 
(R 4 , S 3 \ K) is an open subset of the previous Banach manifold. The space C, where 
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we impose an additional homotopical condition, consists of a connected component 
of this open subset. 

We now turn to B. Recall that the 2-spheres S* C S 3 are the level sets of the 
function H , given by projection onto the 7/2-coordinate. Write H := H o p^ 1 ; then 
the spheres S l = ^(S*) are the level sets of H . This means that B is the inverse 
image of the set of constant functions under the smooth map 

tf: C — ► C°(dB, (-1,1)) 
u i — ► Hou\ m . 

(Beware that a VF fc,p -function loses derivatives, in general, when restricted to the 
boundary, see [221 p. 522].) 

In order to prove that B is a submanifold of C, we need to verify that \& is 
transverse to the subspace of constant functions in C°(<9B, (—1,1)). Write R for 
the tangent space to this subspace. Then the condition to be verified is that for 
u € B the composite map 

*': T u e^T nu) C°(dID,(-l,l)) = C°(SD,R) — > C°(dB,R)/R 
is surjective and its kernel splits, cf. [19 , Proposition II.2.4]. 

(i) We show that T u ^ is surjective (for all u 6 C); this obviously implies sur- 
jectivity of V&'. Choose an auxiliary Riemannian metric (.,.) on S 3 . Then the 
differential T u ^> sends a tangent vector 77 G T U C to 

Tu^iv) = (Vi?ou| aD ,77| aD ), 

which is a real- valued function on dD. 

Let /o G C°(<9B,R) be given. Poisson's formula for the ball, cf. [§j, gives the 
unique solution 770 of the boundary value problem 

\7 J-f 
(P) A?7o = 0, 77o|a D = /o-- — ^— ° u\ao, 

\\vH\\ J 

with 770 : B — >• R 4 continuous on D and smooth in the interior of D. Then 770 G r u C 
and T„*(?7o) = /o- 

(ii) Define "H C T U C as the space of solutions of the boundary value problem (P) 
corresponding to functions /o G C°(9B,R) with /o(l) = 0. We claim that H is 
complementary to ker^' in the sense of Banach spaces, i.e. H is closed in T U C, and 
ker^'(SH = T u C. 

Given 77 G T U C, define / = T„*(?7) G C°(dB,R). Let 770 G % be the solution 
of (P) for /o = / — /(I)- Then 77 obviously decomposes as 77 = (77 — 770) + 770, and 
77 — 77 G ker \&', since 

T„vI>(77-77o) = /-(/-/(!)) = /(I) 

is a constant function. This shows that T u ^ = ker 'J' + H. 

If 77 G kerVP' n "H, then T u ^/(r]) is a constant function on 9B which takes the 
value at 1, so it is identically zero. By the uniqueness of the solution of (P) this 
forces 770 = 0. So we have a direct sum decomposition oi T^. 

It remains to show that H is a closed subspace of T M C. Let (77,,) be a sequence 
in H converging in the W 1,p -norin to some 770 G T U C. By elliptic regularity [221 
Theorem B.3.1], (rj v ) is a W^-Cauchy sequence on the interior of B, and by the 
Sobolev embedding theorem a CjQ C -Cauchy sequence for all k G N. It follows that 
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rjo is continuous on D, smooth on the interior of D, and it solves the boundary value 
problem (P) for a suitable / £ C°(<9ID), K) with / (1) = 0. This means that 770 € "H, 
so H is closed. □ 

In order to prove that W^ cBisa 5-dimensional submanifold, we need to show 
that the linearisation D u of d at u £ W v C B is a surjective Fredholm operator with 
a 5-dimensional kernel. (The splitting of ker D u is then given by a closed subspace 
isomorphic to the image of D M .) For the proof of Proposition 15.21 we also have to 
exhibit explicit generators of this kernel. Since these are local statements, we first 
introduce a suitable local chart around a given Bishop disc u £ W v cKof level t. 
For an open neighbourhood U of u(D) in C 2 , the set {w £ B: w(B>) C U} is, by the 
Sobolev embedding theorem, an open neighbourhood of u in B. Our aim is to find 
a neighbourhood U and a chart 1 : U — > C x C adapted to our Fredholm problem. 

To this end, choose a frame ei, e 2 of TS 1 along u(dB>) with e\ tangent to u(9B), 
with ti tangent to the characteristic foliation Sj, and with J^ei — which is tangent 

to S 3 but transverse to S 1 * — pointing in the direction of increasing H. Choose 
a Riemannian metric on R which makes ei, Joe±,e2, Jq&2 an orthonormal frame 
along u(dU)). Set e' 2 = e 2 /||ViJ||, where the gradient and norm are taken relative 
to the restriction of the chosen metric to S 3 . Notice that J e' 2 equals V7?/||Vi/|| 2 , 
since both vector fields are tangent to S 3 , orthogonal to S l pointing in the same 
direction, and have the same length. We may extend u to an embedding defined on 
a small neighbourhood of D in C Then we find an open neighbourhood of D x {0} 
in C x C on which the map 

C x C — ► C 2 

(z 1 ,x 2 + iy 2 ) ' — ► exp u(zi) (x2e 2 + y2Joe 2 ) 

defines an embedding. Let t be the inverse map, defined on the image of that embed- 
ding. We summarise the properties of 1, which are obvious from the construction, 
in the following lemma. 

Lemma 7.4. There is a neighbourhood U of u(D) C K 4 and an embedding l: U —$■ 
C x C with the following properties: 

(1) c(u(z)) = (z, 0) for all zeD. 

(2) i*J := Tlo J oTr 1 = J on TK 4 | Dx { }. 

(3) For each z £ <9B, the differential T u ( z )t sends the tangent direction along 
the leaf of the characteristic foliation St through u(z) to {0} xlc T( z0 )C 2 . 

(4) For each z £ dD, the differential T u(z) i sends V-ff/||VH|| 2 to (0,z) C 
T( Z , )C 2 . ' D 

Thanks to this lemma we may now assume that u is the inclusion D^Dx {0} C 
C x C, at the cost of replacing Jq by a complex structure J that varies from point 
to point, but coincides with J along D x {0}. 

So the linearisation D u of the Cauchy-Riemann operator dj = d x + Jd y at the 
point u £ W v C B is given by 

D u : T U B — > _ LP(B,M 4 ) 

7] 1 — ► di-j + (d v J)d y u. 

This is a real linear Cauchy-Riemann operator in the sense of |22[ Appendix C] 
and so is Fredholm. 
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Proposition 7.5. The operator D u is onto and has a 5 -dimensional kernel. This 
kernel is spanned by du(aut(D)) , where out(D) denotes the 3- dimensional Lie algebra 
of infinitesimal holomorphic automorphisms of D, and two smooth sections a, r of 
u*iT < C 2 ,TS t ) and u*(TC 2 , TS 3 ), respectively, with the following properties: 

(1) For z G {1, i, — 1}, the vector a (z) € T u ^S l , is tangent to the characteristic 
foliation St. 

(2) The component of the vector field r|an orthogonal to TS* is a positive con- 
stant multiple o/ Viy/|| ViJ || 2 . Foru £ W C W we may assume in addition 
that for k = 0, 1,2 the vector r(i ) E T u ^k\S 3 is tangent to Qk- 

(3) For each z£l the vectors cr(z) and t(z) span the complement of du(T z TS>) 
inT uiz) R 4 . 

Remarks. (1) The notation a and r is meant to be mnemonic. The component 
of the vector field cr|aD orthogonal to u(dD) points along a leaf of the character- 
istic foliation St in the direction of the positive s-parameter; the vector field t\qd 
corresponds to an infinitesimal shift of i-level spheres. 

(2) Conditions (1) and (2) in the proposition guarantee that for u € W the 
tangent vectors a and r lie in the kernel of the differential of the evaluation map 
evi^-i, since 

T„ev M _i(?7) = (7](l),n(i),n(-l)). 

So a and r may be regarded as elements of the tangent space T U W '. 



Proof of Proposition \ 7. 5\ In our local model around u, the elements n of the tangent 
space T U B can be written as sections of the form (yy 11 1 r) ± ) e W 1,P (B,C C). The 
condition that n(z) be tangent to S 3 for z e <9B translates into 

rv^{z) € izR for z € dB. 

The real part of n L (z) for z £ 9D corresponds to the tangential direction of a leaf of 
the characteristic foliation St; the imaginary part, to VH/\\\7H\\ 2 . The flow of the 

latter vector field preserves the level spheres of H . Since the Banach manifold B 
consists of discs whose boundary lies in such a level sphere, we have the boundary 
condition 

fonr) (z) = const, for z <G <9B. 
Our real linear Cauchy-Riemann operator D u now takes the form 
D u : T U B — > I"(D,C_ffiC) 

where A and -B are smooth maps from D into the real endomorphisms of C = R 2 . 
Elliptic regularity once again gives us smoothness of the tangent vectors that lie in 
kei D u cT u B. 

(i) When the boundary condition on -q 1 - is strengthened to Im i]- 1 = for z € c© 
— this corresponds to tangent vectors along the submanifold of B of maps of level t 
— , the vector (7?" (z),r] ± (z)) lies in the totally real subspace izW © K C C © C for 
each z £ 3D. This means we are dealing with a Riemann-Roch boundary value 
problem in the sense of 22, p. 544]. The Maslov index of the bundle pair describing 
the boundary condition equals 2, so by [22J Theorem C.1.10] the operator D u is a 
surjective Fredholm operator of index 4. A fortiori, D u is onto when the boundary 
condition is relaxed to hxi r](z) — const, for z G <9D. 
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(ii) We first consider tangent vectors of the form (r)" , 0). Such a tangent vector 
lies in kerZ3 u precisely when 77" is holomorphic. By expanding 77" into a power 
series on D, it is easy to see that the condition 77" (z) £ izM. for z £ <9B is equivalent 
to 77" being of the form 

77" (z) = a + ibz — az 

for some a £ C and b £ R, cf. [14, p. 75]. It is a nice calculation to verify that 
these are indeed the infinitesimal Mobius transformations of D. In other words, 
the subspace of kerD u of elements of the form (77", 0) is the 3-dimensional space 
out(D) x {0}. 

(iii) If (r]",r]~ L ) is an element of kerD u , then in particular (8 + B)^ 1 - = 0, and 
for all z £ dD we have 77 (z) — ci for some real constant c; we call such an rj 1 - 
admissible. Any admissible 77^ can be written as 77 = w+ci, where w is a smooth 
solution of the Riemann problem 

(Re) (3 + B)w = -cBi, w{z) eRforze 3D. 

(Beware that on the right-hand side of this equation, B is a real endomorphism 
acting on the vector i £ C = R 2 .) Conversely, any solution w of (R c ) gives rise to 
an admissible 77 = w + ci. 

The boundary condition in (R c ) defines a bundle pair of Maslov index 0, so by 
the Riemann-Roch theorem [22J Theorem C.1.10] the operator 3 + B is a surjective 
Fredholm operator of index 1. Choose a non-trivial solution wq of (Ro) and any 
solution w\ of (Ri). If w c is a solution of (R c ), then w c — cw\ is a solution of (Ro). 
It follows that the space of pairs (w c , c) with w c a solution of (R c ) is 2- dimensional, 
spanned by (wq,0) and (tui,1). We conclude that the space of admissible 77 is 
spanned by 77^ := wo and 77^ := w 1 + i. 

(iv) Next we want to show that any admissible i] 1 - is in fact the component of 
a tangent vector (r]\r] ± ) that lies in ker£>„. Thus, given an admissible r] X , we 
are now looking for a solution 77" : D — >• C of the equation 977" = —A-q^, with 
77" (z) £ izM. for all z £ 83. This boundary condition defines a bundle pair of 
Maslov index 2, so again by the Riemann-Roch theorem [22J, Theorem C.1.10] the 
operator 8 is a surjective Fredholm operator of index 3. By (ii) we find a unique 
solution 77" with prescribed values in izM. at the points z = l,i, — 1. 

Starting with 77^ or 77^ from (iii) we obtain the tangent vectors a and r that 
satisfy conditions (1) and (2), respectively, of the proposition. The 77 "-component 
of a has to vanish at the points 1, i, — 1, since the real part of r] 1 - corresponds to the 
direction of the characteristic foliation St along 8H; for u £ W the 77 "-component 

77^ of t is determined by the condition 

vl(i k )+viii k )^T u(ik) Q k , k = 0,1,2. 

(v) It remains to verify condition (3). For this we need to show that an K- linear 
combination A0W0 + -^1 ( u 'i + i) that vanishes at some point in D has to be trivial. 
The 1-dimensional Carleman similarity principle [131 Corollary 13] guarantees that 
any solution 77^ of the equation (S + B)^ 1 - = can be written in the form rj 1 - = e 9 f, 
where /, g : D — > C are continuous functions with / holomorphic in the interior of D 
and g(8U)) c R. Now write AoWq + Xi(wi + i) in this form. We need to show that 
Aq = Ai = if / has any zeros. 



22 HANSJORG GEIGES AND KAI ZEHMISCH 

Suppose there is a point zq £ V) with f(zo) = 0. If zq £ <9B, then Ai = 0. If zq 
lies in the interior of D, we may assume after a Mobius transformation of D that 
zq = 0. The function /, being continuous on D and holomorphic on IntB, satisfies 
the mean value property 

= /(0) = ^- l* f{e»)d$. 

By considering the imaginary part of this equation we see once again that Ai = 0. 
This means that the function Im/, which is harmonic on the interior of H> and 
continuous on D, is zero on 9H>, and hence zero on D by the maximum principle. 
So the holomorphic function / is constant, and since it vanishes in zq it is the zero 
function, which implies Aq = 0. □ 

Proof of Proposition \5.1[ The moduli space W^ has been established as a smooth 
compact surface. So the image of the evaluation map evi will likewise be compact. 
The differential T u evi has full rank by the preceding proposition, since T u evi(^) = 
?7(1). So the image of evi will also be open in Q , which means that evi is in fact 
surjective. Injectivity of evi follows from Proposition 14.51 □ 



Proof of Provosition \5.2[ Since the bidisc D x V>i-s is a compact manifold, we need 
only show that F is an injective immersion. Injectivity follows from all Bishop 
discs being embedded and mutually disjoint. That the differential of F s has full 
rank is an immediate consequence of Proposition 17.51 (3). □ 

8. Topological intersection of discs 

In Section below we want to establish results about the positivity of intersec- 
tions of holomorphic discs that are instrumental for proving Proposition 14.51 The 
present section deals with the topological preliminaries. The main results in this 
and the following section can be found in [31] , but we feel that a more self-contained 
presentation is appropriate in our context. Some subtle issues concerning intersec- 
tion points accumulating at the boundary (see Proposition ^. ll below 1 ) have not been 
addressed in the cited paper. Intersection properties of holomorphic discs are also 
discussed in [301 Section 4.4]. 

Let W be an oriented 4-dimensional manifold with boundary M — dW, and 
£ C M an embedded oriented surface. Our aim in this section is to define an 
intersection pairing on the relative homotopy group ^(W 7 , £). Elements in this 
group are based homotopy classes of C°-maps (D, dU>) — > (W, £). 

Definition. Let V be the space of C°-maps (D,9B) -> (W,E). A disc u £ V is 
called admissible if 

(1) u is smooth, 

(2) u maps the interior of D into the interior of W, 

(3) u\go is an immersion, and 

(4) u is transverse to M. 

Write A <ZT> for the subset of admissible discs. 

Observe that in the case of W = D 4 , M = S 3 , and ScS 3 a totally real surface, 
any holomorphic disc u £ T> is admissible thanks to regularity and the maximum 
principle. 
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The following is a consequence of standard approximation and general position 
results in differential topology, see [321 Chapter 2] . 

Proposition 8.1. The subset of admissible discs A is dense in T>. □ 

The set of all intersection points of two discs u\, u 2 € T> is 

S(u 1 ,u 2 ) := {(z 1: z 2 ) e D x B: Ui(zi) = u 2 (z 2 )}. 

For the purpose of defining an intersection number of u\ and u 2 we distinguish the 
following subsets of S(ui,u 2 ). 

Definition. The set of interior intersection points of U\,U2 € T> is 

S\nt{ui,u 2 ) := S{u x ,u 2 )r\ (intB x IntB). 

The set of boundary intersection points is 

S d (ui,u 2 ) :=S , (ui,u 2 )n(9Bx <9B). 

By the requirement that an admissible disc map Int D to the interior of W, there 
can be no mixed intersection points, hence 

S(ui,u 2 ) = Si nt {ui,u 2 ) US , o(mi,m 2 ). 

Definition. We say that two admissible discs u\,u 2 e A intersect nicely if 

S(ui,u 2 ) is a finite set of isolated intersection points, and if for any (21,2:2) G 
Sg(ui,u 2 ) the two 2-planes 

d Ul {T Zl B), du 2 {T Z2 V>) c T Ul{zi) W = T U2{Z2) W 

either are transverse or coincide. Write V C A x A for the set of pairs of nicely 
intersecting discs. 

Notice that the second condition of this definition is automatically satisfied for 
any two holomorphic discs. 

Proposition 8.2. The set V of pairs of nicely intersecting discs is dense in T> x T>. 



Proof. By Proposition ^. H it suffices to show that V is dense in .Ax .4. Let m, u 2 € A 
be given. We are going to find an arbitrarily C°°-small isotopic perturbation of u\ 
to a new disc intersecting u 2 nicely. The necessary differential topological methods 
are supplied by [12] . 

In a first step, we find an arbitrarily C°°-small homotopy of mi|od to an immer- 
sion transverse to u 2 \ga. By sweeping out this homotopy over a collar neighbour- 
hood of M in W, this extends to a homotopy of u\. We continue to write u\ for 
the new map after this and each of the following perturbations. 

So after this first step the maps ui|an and M2I9D are transverse immersions of the 
circle into S, but u\ and u 2 may fail to be transverse along dD. This, however, can 
only happen at one of the finitely many intersection points of u±(dD) and u 2 (<9B). 
In a collar neighbourhood (— e, 0] x M of M in W one can define a C°°-small 
perturbation of Ui, supported in this collar neighbourhood and relative to 9B, in a 
direction transverse to S C M. 

After this second perturbation we may assume that u\ and u 2 are transverse 
near <9IB). Then Si n t(ui, u 2 ) does not have any accumulation points at the boundary, 
and a further perturbation of u\ outside a neighbourhood of 9B in B gives us a 
nicely intersecting pair (ui,u 2 ). □ 
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Remark. The proof shows that even the subset of pairs (1*1,142) with transverse 
intersections only is dense in A x A. 

As we shall see in the next section, two distinct holomorphic discs will intersect 
nicely, but not — a priori — transversely. For this reason, the above proposition 
has been phrased in terms of nicely intersecting discs. We now wish to define 
an intersection number for such discs by assigning an intersection multiplicity to 
each isolated intersection point, cf. [21] . [22| Definition E.2.1]. We distinguish 
three types of intersections, where in each case we write p := ui(zi) = 1*2(22) for 
(21,-22) € 5(1*1, 1*2). 

(i) Interior intersections. Given (21,22) G Sint (1*1,1*2), we define the intersection 
multiplicity (1*1 • i*2)( 2l .z 2 ) as in the cited references. Choose neighbourhoods of p 
in 1*1 (D) and 112(D) that intersect in p only. Then perturb the discs inside these 
neighbourhoods, keeping either neighbourhood disjoint from the boundary of the 
other, so as to achieve transversality of intersection. Then count the intersection 
points with sign according to the orientation. Notice that for the definition of this 
intersection multiplicity the maps 1*1 and 1*2 need not be smooth; it suffices that 
the intersection points be isolated. 

(ii) Transverse boundary intersections. Here the intersection multiplicity can be 
defined as for an interior transverse intersection point. If (21,22) € 5,9(1*1,1*2) is a 
transverse intersection point, the Whitney sum 

di*i(T Zl B) e du2(T, a B) 

is isomorphic to T p W. The intersection multiplicity (1*1 • i*2)( 2 i,z 2 ) equals ±1, de- 
pending on whether this isomorphism is orientation preserving or reversing. 

(iii) Tangential boundary intersections (cf. |31j . [30]). Let (21,22) € Sq{u\,U2) 
be a tangential intersection point, i.e. di*i(T Zl D) = du2(T Z2 J}). Since 1*1 and 1*2 
are transverse to M from the same side, we can write 142(D) locally as a graph 
over 1*1 (D). It is not possible, in general, to give a homotopy- invariant definition 
of an intersection multiplicity when this 'same side' condition is dropped, see [301 
Example 4.4.7]. We write the local model for 

p e S c M = dW c W 

as 

OelxRcIxCcixC. 

(This complex model is chosen for convenience of notation only; we do not consider 
holomorphic discs yet.) Moreover, we may assume that the local model has been 
selected in such a way that the local description of 1*1 and 1*2 is of the form 

(H,R) — y(IxC,IxI) 

with 

U\{z) = (2,0) and 1*2(2) = (z, h(z)). 
Now extend 1*1 and 1*2 by Schwarz reflection to continuous maps 

(C,R) — >(Cx C,RxM). 

For 1*1 this simply means extending the definition 1*1(2;) = (2,0) to z G C; for 1*2 
we set 



142(2) = (z, h(z)) for Imz < 0. 
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Up to homotopy, the definition of this extension is independent of the choice of 
local coordinates, and so the intersection multiplicity (ui ■ U2)( Zl .z 2 ) is well defined 
as the intersection multiplicity in the sense of (i) of these extended maps. 

The definition of the intersection multiplicity as in (iii) can also be used for 
transverse boundary intersections, in which case it coincides with that in (ii). Just 
as in (i) , this allows us to assign a multiplicity to any isolated intersection point of 
discs that are merely continuous. 

The following example will motivate the definition of the overall intersection 
number of a nicely intersecting pair (111,112)- 

Example. Consider the maps u\(z) = (z,Q) and 112(2) = (z,z 3 ), z € H, in the 
local model just described. The intersection point (0,0) is a tangential boundary 
intersection of multiplicity 3. We can perturb the extended ui within the class of 
maps symmetric with respect to Schwarz reflection to u\(z) — (z, z 3 — e 3 ), z G C, 
with e > 0, say. Then u\ and u\ intersect in the three points eexp(27rifc/3), k — 
0, 1, 2, all of multiplicity 1. One of these intersection points still sits on the original 
boundary, now as a transverse intersection point. The two other intersection points 
correspond to each other under Schwarz reflection; only one of them lies inside the 
actual discs wi,u|, i.e. in the non-extended domain of definition H (in the local 
model) of these discs. 

Notice that we have used the condition that U\,v,2 map <9B to £ C M in an 
essential way for the extension of u\ , U2 via Schwarz reflection. The more obvious 
way of extending u\,U2 by doubling along M, which would require only that 9D 
map to M, would lead to intersection points on the boundary of multiplicity 0, and 
pairs of intersection points (related by the symmetry) of opposite multiplicity. 

Definition. The intersection number of a pair (ux, 112) G V of nicely intersecting 
admissible discs is 

Ml»U 2 :=2 ^ ( u l- u 2)(z u z 2 )+ Yl ("i ' M 2)( Z1 , Z2 ), 
Si„t(ui,« 2 ) Sg{u 1 ,u 2 ) 

where the sums are meant to be taken over the pairs (z\, Z2) of points in the sets 
S\nt(ui,U2) and Sd(ui,v,2), respectively. 

As observed by Ye |31( Lemma 7.2], this intersection number is a homotopy 
invariant. 

Proposition 8.3. The intersection number u\ • U2 depends only on the homotopy 

classes [ui], [112] £ ^(W, E). 

Our proof follows a route slightly different from Ye's. Before we turn to the 
formal argument, we illustrate by an example the kind of deformations we shall 
employ. 

Example. We can picture transverse boundary intersections in the same local 
model used above to describe tangential boundary intersections. We always take 
u\(z) — (z,0). The two maps U2(z) — (z,z) and u^z) = (z,—z) give local models 
for a transverse boundary intersection, both of the same sign, i.e. (u\ ■ M2)(o.o) = 
(ui ■ 1*2) (o,o) • However, the signs of the intersection of the boundary curves inlxK 
are different: (mi|r • U2|k)(o,o) = -(«i|r • m||r)(o,o)- 



26 HANSJORG GEIGES AND KAI ZEHMISCH 

Choose some small e > and let h 2 : Kj — > R be a smooth function with 
hi{y) — y for y <E [0,e) and /i2(j/) = —y for y > 2e, with a single further zero at 
3e/2, where /i 2 (3e/2) < 0. Then the map z = x + iy M> (z, —x + iti2{y)) coincides 
with u\ outside a small neighbourhood of the boundary, but near the boundary it 
looks like z h-» (z, —~z). 

Suppose for argument's sake that the orientations in the model had been chosen 
in such a way that {u\ ■ u^iam — 1, but (mi|r • W2k)(o,o) = ~ 1- After the described 
perturbation near the boundary, both intersections are negative. The new trans- 
verse intersection point in the interior at z = 3e«/2 is positive, so the intersection 
number u\ • u\ is not affected by this perturbation. 

This example shows the following. Let Ui,v,2 be a pair of transversely inter- 
secting admissible discs. Then we may perturb u^ near the boundary in such a 
way that the intersection number U\ • u 2 remains unchanged, but the intersection 
multiplicity at each boundary intersection point equals the intersection multiplicity 
of the boundary curves in E at that point. 

Observe that the two models where the intersection multiplicities coincide in 
the way described are z H ► (z, z) and z t— > (z, —z). In either of these models the 
imaginary part of the second coordinate equals y, which means that both models 
depict discs on the same side of S C M in a collar neighbourhood of M = dW 
in W. 

Proof of Provosition \8.3[ Let u\ and u\ be admissible discs with [u\] — [u\] £ 
TT2(W, E), either of which nicely intersects a further admissible disc U2- We need to 
show that u1 • U2 — u\ • U2- 

In case there are tangential boundary intersections, we make a small homotopic 
perturbation of u\,u\ as in the first of our examples so as to get transverse inter- 
sections only. This does not affect the intersection number. Next, we perform a 
homotopy as in the second example to adjust the signs of the boundary intersec- 
tions. We continue to write u\ : u\ for these discs. 

Since v,® and u\ are homotopic, so a fortiori are the boundary curves u^Iod and 
u\\ao- Hence 

^ (U? , W2)(zi,z 2 ) =u\\d®*U2\dB = u\\dB*U2\dn = ^ ( u l ' u 2)( Zl , Z2 ) , 
Sg(u?,u 2 ) So(u\.u 2 ) 

where • is the usual intersection product of curves on E. 

A collar neighbourhood of E in W can be written globally as (— e, 0] x M. x E. 
Given the observation before this proof, we may assume that the disc 1*2 (D) looks 
like (—£,0] x {0} x U2(<9B) in this collar, while the discs u{, j = 0,1, look like 

{{-t,t,q): te[0,e),qeu{{dD)}- 

Now the disc u\, a homotopy from u^Iod to u}|an, and the disc u\ with reversed 
orientation define a map / : S 2 — ¥ W . When we push this 2-sphere away from the 
boundary dW in the direction of the tangent vector (—1, 1, 0, 0) to (— e, 0] x K x E, 
we remove all boundary intersection points without creating any new intersections. 
So the intersection number of this 2-sphere with 1*2 equals 

Y^ ( u i- u 2)(z 1 ,z 2 )- Yl {u\-u 2 ) {zuZ2) . 

Si a t(u1,u 2 ) Si at (ul,u 2 ) 
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This intersection number is well denned homotopically (even homologically), i.e. 
there is an intersection product 

• : H 2 (W)®H 2 (W,dW) — >Z, 

and the intersection product [/] • [u2\ is given by the intersection number above, 
cf. [3J Section VI. 11]. However, from [u\] = [u\] in 7r 2 (W,£) we deduce that / is 
homotopically and hence homologically trivial, so [/] • [112] = 0. This concludes the 
proof. □ 

Remark. Two maps (B, <9B) — > (W, S) that are sufficiently C°-close are nomotopic. 
Thus, with Proposition 18. 21 we can define an intersection product on n2(W, £). 

9. POSITIVITY OF INTERSECTIONS OF HOLOMORPHIC DISCS 

We now want to apply the topological results of the preceding section to holo- 
morphic discs (B, SB) — > (D 4 , 5 1 *). For future reference we formulate the results in 
slightly greater generality. Thus, let (W, J) be a smooth almost complex 4-manifold 
with J-convex boundary dW, and £ c dW an embedded oriented surface totally 
real with respect to J . 

In the present section, by holomorphic disc we shall always mean a smooth 
non-constant J-holomorphic disc (B, <9B) — > (W,S). As we saw in the previous 
section, any holomorphic disc is admissible. Two holomorphic discs u\,U2 are 
called distinct if ui(B) ^ u 2 (D). 

Proposition 9.1. Any two distinct holomorphic discs intersect nicely. 

Proof. By the comment after the definition of 'nicely intersecting' in the previous 
section, it suffices to show that any two distinct holomorphic discs intersect in 
finitely many points only. We are going to prove the contrapositive. That is, let 
ui, U2 be two holomorphic discs for which S(ui,U2) is infinite. We have to show 
that ui(B) =u 2 (B). 

In the infinite set S(u\,U2) we can choose a non-constant sequence (zi,z 2 ), 
v <G N, converging to some point (z^z®) <E S{u\,U2). If {zijZ®) G Sint^i, - ^), 
then by the work of Micallef- White [331 Theorem 7.1] one can find neighbourhoods 
Ui E B of z", i = 1,2, with ui(U~i) — ^(t^)- We claim that the same conclusion 
holds for (zijZz) G Sg(tii,U2). Accepting this claim for the time being, we then 
see that no matter where the intersection points accumulate, the set of points in B 
that have a neighbourhood mapped by u\ into U2(B) is non-empty. This set is open 
by definition, and closed by the same argument used to show that it is non-empty. 
Hence Ui(B) C 1*2 (B). The converse inclusion holds by symmetry of the argument. 

It remains to prove the claim. As in the topological situation we may choose 
local models such that u\, U2 around zj, z!?, respectively, may be regarded as germs 
of maps 

(H, R, 0) — > (H x C, K x M, 0) 
of the form 

u\{z) — (z,0) and U2(z) = (a(z),b(z)). 

Moreover, we may assume that in this local model the almost complex structure J 
coincides with the standard structure Jo along H x {0} cixC. The assumption 
on (zj, z°) being an accumulation point of intersections translates into saying that 
b has an accumulation point of zeros in z — 0. 
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The following interpolation argument is reminiscent of the proof of the unique 
continuation theorem in [22] p. 24] . We write 

P A P 

J(a,b) - J(a,0) = -J(a,tb)dt= D 2 J (a ^ tb) (b)dt. 
Jo at Jo 

This can now be used to simplify the Cauchy-Riemann equation for u 2 . 

= d x u 2 + J(u 2 )d y u 2 = d x u 2 + J d y u 2 + (J(u 2 ) - Jo)d y u 2 

= d x u 2 + J d y u 2 + (J(a, b) - J(a, Q))d y u 2 

= d x u 2 + J d y u 2 + / D 2 J( ai tb)(b) dt ■ d y u 2 . 
Jo 

Define a smooth map B = {B\, B 2 ) from H into the real linear maps C — > C 2 by 

B z rj= l D 2 J (a{z):tb{z)) {ri)dt-d y u 2 {z). 
Jo 

Then the Cauchy-Riemann equation for u 2 decouples into two 1-dimensional equa- 
tions 

(CR a ,&) d x a + id y a + Bib = 0, 

(CRt) d x b + idyb + B 2 b = 0. 

Now one applies a relative version of the Carleman similarity principle to (CR&). 
For this principle in the absolute case see [T31 Corollary 13] (for the 1-dimensional 
situation), [TBI Section A. 6] and [221 Theorem 2.3.5]. The key to a relative version 
of this principle is the observation (Step 2 in the proof of [221 Theorem 2.3.5]) that 
B 2 may be assumed to be complex linear. Then both the solution b of our linear 
equation and the map B 2 can be extended from H to C (near 0) by Schwarz reflec- 
tion. The Carleman similarity principle (cf. Step (v) in the proof of Proposition l7.5[) 
then implies that the solution b is a pointwise complex linear transformation of a 
holomorphic function. The identity theorem applied to this holomorphic function 
yields that this function, and hence 6, is identically zero. □ 

Remark. Relative versions of the Carleman similarity principle have been men- 
tioned previously in [20[ Proposition 3.1] and [JJ Lemma 3.1]. 

In the following theorem we use | . | to denote the cardinality of a finite set. 

Theorem 9.2 (Positivity of intersections). The intersection number of distinct 
holomorphic discs u\,u 2 satisfies the inequality 

Mi uu 2 > 2|S , i nt (ui,u 2 )| + \Sd(ui,u 2 )\, 

with equality if and only if all intersections are transverse. In particular, wi(B) and 
U20D>) are disjoint if and only if u\* u 2 =0. 

Proof. We need to show that the intersection multiplicities {u\-u 2 )i ZlZ2 \ are greater 
than or equal to 1, with equality precisely in the case of a transverse intersec- 
tion. For interior intersection points this result is due to Micallef-White [231 The- 
orem 7.1], cf. [21 Proposition E.2.2]. 

At a transverse boundary intersection point the intersection multiplicity is 1; 
this is seen exactly as for interior transverse intersections. 

For a tangential boundary intersection point we use the local model from the 
preceding proof. Thus, write u\ and u 2 near the intersection point (zi,z 2 ) — 
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(0, 0) £ HI x H as u\(z) = (z,0) and U2(z) = (a(z),b(z)), where a and b satisfy the 
linear Cauchy-Riemann equations (CR a ,b) and (CR&), respectively. The condition 
that the intersection point be tangential means that dbo = 0. Then dao =/= 0, 
since boundary points are non-singular. The relative Carleman similarity principle, 
applied to (CR&), tells us that b is of the form b(z) = b^z k + o(|z fe |) with k > 2 and 
=/= bk G C; this follows from the observation that the pointwise complex linear 
transformation from b to a holomorphic function may be taken as the identity at 
z = by incorporating the transformation at as a multiplicative constant into 
the holomorphic function. 

Moreover, from (CR a ) we see with 6(0) = that the differential dao is complex 
linear. It follows that the intersection multiplicity is k > 2. □ 

We now want to use these results to investigate the self-intersections of holomor- 
phic discs. This will yield the criterion for a holomorphic disc to be embedded that 
we need for the proof of Proposition 14.51 



Definition. Let A 6 ^(W 7 , S) be a relative homotopy class, represented by a C°- 
map u: (B,3B) — > (W, £). By Section El the intersection number A • A is well 
defined. Write /j,(A) for the Maslov index of the bundle pair (u*TW, (u\od)*T'E) 
over (B, 9B). The embedding defect of A is 

D(A) := A • A - /j,(A) + 2. 

The set of self-intersection points of a holomorphic disc u is 

S(u) := {(z u z 2 ) eDxD: u{z x ) - u(z 2 ), Z\ ^ z 2 }. 

Notice that self- intersection points come in pairs (z\, z<£), {z 2 , Z\). As in Section[8]we 
can speak of the interior and the boundary self- intersection points. Write Crit(u) C 
D for the set of critical points of u. Since u is admissible, Crit(w) is contained in Int D 
and does not have any accumulation points on <9B. Hence, Crit(u) is a finite set 
by [22[ Lemma 2.4.1], according to which critical points cannot accumulate at an 
interior point. 

Before we can prove an estimate on the embedding defect of a holomorphic disc 
in terms of the disc's self-intersections and critical points, we need a formula that 
allows us to compute the Maslov index of a bundle pair of complex rank 1 from the 
self-intersection of the zero section, analogous to the formula for first Chern class 
of a complex line bundle, cf. [3TJ Proposition 2.7]. We write ord z s for the order of 
an isolated zero z of a continuous bundle section s, that is, the intersection index 
with the zero section. 

Lemma 9.3. Let (E,F) be a complex rank 1 bundle pair over (D, <9B). Let s be 
a continuous section of (E, F) with isolated zeros. Then the Maslov index of the 
bundle pair is given by 

H(E, F) = 2 ^ ord z s + J^ ord z s. 

2£lntD zGdB 

Proof We use a doubling argument (based on Schwarz reflection) as in [151 p. 157]. 
Write B for the disc D with reversed orientation. We double B to the Riemann 
sphere S 2 — B Uod B. Reflection in <9B defines an anti-holomorphic involution 
onS 2 . 
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The double of E is constructed similarly. Write E for the complex line bundle 
over D whose fibre E-% over ~z S KJ is E z with the conjugate complex structure. The 
gluing of E z with E— over z g <9B is effected by the anti-complex involution in F z . 

Then the section s doubles to a continuous section s U s of the complex line 
bundle E U E over S 2 , with the total order of zeros ord(s U s) given by the right- 
hand side of the equation in the lemma. Now the composition formula for the 
Maslov index [221 Theorem C.3.5] and the relation of the Maslov index with the 
Chern class [H Theorem C.3.6] yield 

2/j,(E,F) = n(E,F)+n(E,F) = /i(£ul,0) 
= 2( Cl (E(JE),[S 2 }) = 2ord(sUs). 

The claimed formula for the Maslov index follows. □ 

The following theorem is a quantitative version of 31, Theorem 7.6]; cf. [23j 
Theorem 7.3] for the adjunction inequality in the absolute case. Recall the definition 
of a simple disc from Section |4] 

Theorem 9.4 (Relative adjunction inequality). Let u be a simple holomorphic disc. 
Then the set S(u) of self-intersections is finite, and the embedding defect satisfies 

D([u}) > 2\S lnt (u)\ + \S d (u)\ + 4| Crit(«)|. 

For Crit(u) = we have equality in this formula if and only if all self-intersections 
are transverse. In particular, u is an embedding if and only if D([u\) = 0. 

Proof. The finiteness of S(u) is shown as in the proof of Proposition 19.1) here the 
assumption that u be simple is used. 

A lemma of Frauenf elder [9], cf. [221 Lemma 4.3.3] allows us to choose a Rie- 
mannian metric on W for which J is an orthogonal endomorphism field, and for 
which the totally real submanifold £ is totally geodesic. 

First case: Crit(w) is empty. Then (du(TD), du(TdU>)) is a subbundle pair of 
(u*TW, (u\gv,)*TT,) of Maslov index 2. By our choice of metric, the orthogonal 
complement of this subbundle pair is again a subbundle pair; write fx for its 
Maslov index. Then 

D([u}) — u • u — /! . 

Choose a smooth section s of this orthogonal bundle transverse to the zero 
section. We may assume that the zeros of s are disjoint from the non-injective 
points of u, i.e. the points z £ ID for which the preimage u~ 1 (u(z)) contains a point 
other than z. For s sufficiently small, an admissible disc u-i nicely intersecting 
u\ := u can be defined by U2 '■— exp ni s. By the homotopy invariance of the 
intersection number we have u • u — U\ • u<i. The intersection number u\ • u-i is 
given as a sum over intersection multiplicities, with interior intersections counting 
twice. Diagonal intersection points (z, z) of u\ and u-i correspond exactly to the 
zeros of the section s. By the preceding lemma it follows that the contribution of 
these diagonal intersection points to u\ • u^ equals /i^. 

We claim that the contribution of the non-diagonal intersection points to u\ • u^ 
equals the sum over the multiplicities of the self-intersection points of u (with in- 
terior points counted twice). These non-diagonal intersection points arise in pairs 
(z' x ,z'y), (z", z'2) corresponding to a pair of self-intersection points (z±, z-i), (Z2, z\) e 
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S(u). So the two sums of intersection multiplicities are indeed equal. This discus- 
sion can be summarised in the formula 

Si„t(u) Sa(u) 

From here the argument concludes as in the proof of Theorem 19.21 

Second case: Crit(w) is non-empty. Here one can use a perturbation argument 
to turn critical points into self-intersections, see [351 Proposition E.2.4]. A critical 
point z of a holomorphic disc u is said to be of order k e N if at z the fc-jet of 
du is the lowest order non-vanishing jet. For instance, a critical point of order 1 is 
characterised by the non-vanishing of the Hessian of u. As shown in [22j p. 610] 
and [23l Theorem 7.3], each critical point of order k gives rise to at least k(k + 1) 
intersection points. Since all critical points lie in the interior, these intersection 
points are counted twice in the intersection product, i.e. the contribution to the 
embedding defect is 2k(k + 1). 

If a critical point happens to be an intersection point of the original disc, we may 
think of the situation near this point as the intersection of two local discs. When we 
perturb and desingularise these two local discs, we obtain transverse intersections of 
the two local discs and self-intersections. The former contribute to the intersection 
product according to the intersection index, the latter contribute as analysed in the 
foregoing paragraph. □ 
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